# Hierarchical feature extraction
Mambavision L3 256 21K
Other
The first hybrid computer vision model combining the strengths of Mamba and Transformer, enhancing visual feature modeling efficiency by reconstructing the Mamba formula, and introducing self-attention modules in the final layers of the Mamba architecture to improve long-range spatial dependency modeling.
Image Classification
Transformers

M
nvidia
510
7
Mambavision L2 1K
Other
MambaVision is the first hybrid computer vision model combining the strengths of Mamba and Transformer. It enhances visual feature modeling by reconstructing the Mamba formulation and incorporates self-attention modules in the final layers of the Mamba architecture to improve long-range spatial dependency modeling.
Image Classification
Transformers

M
nvidia
56
13
Swinv2 Small Patch4 Window8 256
Apache-2.0
Swin Transformer v2 is a vision Transformer model that achieves efficient image processing through hierarchical feature maps and local window self-attention mechanisms.
Image Classification
Transformers

S
microsoft
1,836
0
Featured Recommended AI Models